home *** CD-ROM | disk | FTP | other *** search
/ IRIX Base Documentation 2001 May / SGI IRIX Base Documentation 2001 May.iso / usr / share / catman / a_man / cat1 / wchrtbl.z / wchrtbl
Encoding:
Text File  |  1998-10-20  |  33.7 KB  |  594 lines

  1.  
  2.  
  3.  
  4. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  5.  
  6.  
  7.  
  8. NNNNAAAAMMMMEEEE
  9.      _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll - generate character classification and conversion tables for
  10.      ASCII and supplementary code sets
  11.  
  12. SSSSYYYYNNNNOOOOPPPPSSSSIIIISSSS
  13.      _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll [_f_i_l_e]
  14.  
  15. DDDDEEEESSSSCCCCRRRRIIIIPPPPTTTTIIIIOOOONNNN
  16.      _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll creates tables containing information on character
  17.      classification, character conversion, character set width, and numeric
  18.      editing.  The first table is a multi-byte-sized array encoded such that a
  19.      table lookup can be used to determine the character classification of a
  20.      character, convert a character [see _cccc_tttt_yyyy_pppp_eeee(3C) and _wwww_cccc_tttt_yyyy_pppp_eeee(3W)], and find
  21.      the byte and screen width of a character in one of the supplementary code
  22.      sets.  The size of the array is (2*257*4) + 7 bytes:  257*4 bytes are
  23.      required for the 32-bit code set character classification table, 514*2
  24.      bytes for the upper- to lowercase and lower- to uppercase 16-bit
  25.      conversion table, and 7 bytes for character set width information.
  26.  
  27.      The second table contains information about the format of non-monetary
  28.      numeric quantities: the first byte specifies the decimal delimiter; the
  29.      second byte specifies the thousands delimiter; and the remaining bytes
  30.      comprise a null-terminated string indicating the grouping (each element
  31.      of the string is taken as an integer that indicates the number of digits
  32.      that comprise the current group in a formatted non-monetary numeric
  33.      quantity).
  34.  
  35.      If supplementary code sets are specified, additional variable sized
  36.      tables are generated for multibyte character classification and
  37.      conversion.
  38.  
  39.      _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll reads the user-defined character classification and conversion
  40.      information from _f_i_l_e and creates three output files in the current
  41.      directory.  One output file, _wwww_cccc_tttt_yyyy_pppp_eeee_...._cccc (a C language source file),
  42.      contains the variable sized array generated from processing the
  43.      information from _f_i_l_e.  You should review the content of _wwww_cccc_tttt_yyyy_pppp_eeee_...._cccc to
  44.      verify that the array is set up as you had planned.  The first 257*4
  45.      bytes of the array in _wwww_cccc_tttt_yyyy_pppp_eeee_...._cccc are used for storing 32-bit character
  46.      classification for each of 257 single byte characters.  The characters
  47.      used for initializing these bytes of the array represent character
  48.      classifications that are defined in _cccc_tttt_yyyy_pppp_eeee_...._hhhh; for example, ______LLLL means a
  49.      character is lower case and ______SSSS|______BBBB means the character is both a spacing
  50.      character and a blank.  The second 257*4 bytes of the array are used for
  51.      character conversion with 514 elements consisting of 16-bit each.  These
  52.      bytes of the array are initialized so that characters for which you do
  53.      not provide conversion information will be converted to themselves.  When
  54.      you do provide conversion information, the first value of the pair is
  55.      stored where the second one would be stored normally, and vice versa.
  56.      For example, if you provide _<<<<_0000_xxxx_4444_1111 _0000_xxxx_6666_1111_>>>>, then _0000_xxxx_6666_1111 is stored where _0000_xxxx_4444_1111
  57.      would be stored normally, and _0000_xxxx_6666_1111 is stored where _0000_xxxx_4444_1111 would be stored
  58.      normally.  The last 7 bytes are used for character width information.  Up
  59.      to three supplementary code sets can be specified.
  60.  
  61.  
  62.  
  63.                                                                         PPPPaaaaggggeeee 1111
  64.  
  65.  
  66.  
  67.  
  68.  
  69.  
  70. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  71.  
  72.  
  73.  
  74.      For supplementary code sets, there are three sets of tables.  The first
  75.      set is three pointer arrays which point to supplementary code set
  76.      information tables.  If the corresponding supplementary code set
  77.      information is not specified, the contents of the pointers are zeros.
  78.      The second one is a set of three supplementary code set information
  79.      tables.  Each table contains minimum and maximum code values to be
  80.      classified and converted, and also contains pointers to character
  81.      classification and conversion tables.  If there is no corresponding
  82.      table, the contents of the pointers are zeros.  The last one is a set of
  83.      character classification and conversion tables which contain the same
  84.      information as the single byte table except that the codes are
  85.      represented as process codes and the table size is variable.  The
  86.      characters used for initializing values of the character classification
  87.      table represent character classifications that are defined in _cccc_tttt_yyyy_pppp_eeee_...._hhhh and
  88.      _wwww_cccc_tttt_yyyy_pppp_eeee_...._hhhh.  ______EEEE_1111 through ______EEEE_8888 are for international use and ______EEEE_9999 through ______EEEE_2222_4444
  89.      are for language-dependent use.
  90.  
  91.      The second output file (a data file) contains the same information, but
  92.      is structured for efficient use by the character classification and
  93.      conversion routines [see _cccc_tttt_yyyy_pppp_eeee(3C) and _wwww_cccc_tttt_yyyy_pppp_eeee(3W)].  The name of this
  94.      output file is the value of the character classification _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE read in
  95.      from _f_i_l_e.  This output file must be copied to the
  96.      _////_uuuu_ssss_rrrr_////_llll_iiii_bbbb_////_llll_oooo_cccc_aaaa_llll_eeee_////_l_o_c_a_l_e_////_LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE file by someone who is super-user or a
  97.      member of group _bbbb_iiii_nnnn.  This file must be readable by user, group, and
  98.      other; no other permissions should be set.  To use the character
  99.      classification and conversion tables on this file, set the _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE
  100.      category of _ssss_eeee_tttt_llll_oooo_cccc_aaaa_llll_eeee [see _ssss_eeee_tttt_llll_oooo_cccc_aaaa_llll_eeee(3C)] appropriately.
  101.  
  102.      The third output file (a data file) is created only if numeric editing
  103.      information is specified in the input file.  The name of the file is the
  104.      value of the character classification _LLLL_CCCC______NNNN_UUUU_MMMM_EEEE_RRRR_IIII_CCCC read from the _f_i_l_e.
  105.      This output file must be copied to the _////_uuuu_ssss_rrrr_////_llll_iiii_bbbb_////_llll_oooo_cccc_aaaa_llll_eeee_////_l_o_c_a_l_e_////_LLLL_CCCC______NNNN_UUUU_MMMM_EEEE_RRRR_IIII_CCCC
  106.      file by someone who is super-user or a member of group _bbbb_iiii_nnnn.  This file
  107.      must be readable by user, group, and other; no other permissions should
  108.      be set.  To use the numeric editing information on this file, set the
  109.      _LLLL_CCCC______NNNN_UUUU_MMMM_EEEE_RRRR_IIII_CCCC category of _ssss_eeee_tttt_llll_oooo_cccc_aaaa_llll_eeee appropriately.
  110.  
  111.      If no input file is given, or if the argument _---- is encountered, _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll
  112.      reads from standard input.
  113.  
  114.      The syntax of _f_i_l_e allows the user to define the name of the data file
  115.      created by _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll, the assignment of characters to character
  116.      classifications, the relationship between conversion letters, and byte
  117.      and screen widths for up to three supplementary code sets.  The keywords
  118.      recognized by _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll are:
  119.  
  120.      _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE                name of the first data file to be created by
  121.                              wchrtbl
  122.  
  123.      _iiii_ssss_uuuu_pppp_pppp_eeee_rrrr                 character codes to be classified as uppercase
  124.                              letters
  125.  
  126.  
  127.  
  128.  
  129.                                                                         PPPPaaaaggggeeee 2222
  130.  
  131.  
  132.  
  133.  
  134.  
  135.  
  136. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  137.  
  138.  
  139.  
  140.      _iiii_ssss_llll_oooo_wwww_eeee_rrrr                 character codes to be classified as lowercase
  141.                              letters
  142.  
  143.      _iiii_ssss_aaaa_llll_pppp_hhhh_aaaa                 character codes to be classified as letters
  144.  
  145.      _iiii_ssss_dddd_iiii_gggg_iiii_tttt                 character codes to be classified as numeric
  146.                              digits
  147.  
  148.      _iiii_ssss_ssss_pppp_aaaa_cccc_eeee                 character codes to be classified as white-space
  149.                              (delimiter) characters
  150.  
  151.      _iiii_ssss_pppp_uuuu_nnnn_cccc_tttt                 character codes to be classified as punctuation
  152.                              characters
  153.  
  154.      _iiii_ssss_cccc_nnnn_tttt_rrrr_llll                 character codes to be classified as control
  155.                              characters
  156.  
  157.      _iiii_ssss_bbbb_llll_aaaa_nnnn_kkkk                 character codes to be classified as blank
  158.                              characters
  159.  
  160.      _iiii_ssss_pppp_rrrr_iiii_nnnn_tttt                 character codes to be classified as printing
  161.                              characters, including the space character
  162.  
  163.      _iiii_ssss_gggg_rrrr_aaaa_pppp_hhhh                 character codes to be classified as printable
  164.                              characters, not including the space character
  165.  
  166.  
  167.      _iiii_ssss_xxxx_dddd_iiii_gggg_iiii_tttt                character codes to be classified as hexadecimal
  168.                              digits
  169.  
  170.      _uuuu_llll                      relationship between conversion characters
  171.  
  172.      _cccc_ssss_wwww_iiii_dddd_tttt_hhhh                 byte and screen width information
  173.  
  174.      _LLLL_CCCC______NNNN_UUUU_MMMM_EEEE_RRRR_IIII_CCCC              name of the second data file created by _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll
  175.  
  176.      _dddd_eeee_cccc_iiii_mmmm_aaaa_llll______pppp_oooo_iiii_nnnn_tttt           decimal delimiters
  177.  
  178.      _tttt_hhhh_oooo_uuuu_ssss_aaaa_nnnn_dddd_ssss______ssss_eeee_pppp           thousands delimiters
  179.  
  180.      _gggg_rrrr_oooo_uuuu_pppp_iiii_nnnn_gggg                string in which each element is taken as an
  181.                              integer that indicates the number of digits that
  182.                              comprise the current group in a formatted non-
  183.                              monetary numeric quantity.
  184.  
  185.      _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE_1111               specify that functions for specification of
  186.                              supplementary code set 1 follows
  187.  
  188.      _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE_2222               specify that functions for specification of
  189.                              supplementary code set 2 follows
  190.  
  191.  
  192.  
  193.  
  194.  
  195.                                                                         PPPPaaaaggggeeee 3333
  196.  
  197.  
  198.  
  199.  
  200.  
  201.  
  202. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  203.  
  204.  
  205.  
  206.      _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE_3333               specify that functions for specification of
  207.                              supplementary code set 3 follows
  208.  
  209.      _iiii_ssss_pppp_hhhh_oooo_nnnn_oooo_gggg_rrrr_aaaa_mmmm_((((_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_1111_))))   character codes to be classified as phonograms in
  210.                              supplementary code sets
  211.  
  212.      _iiii_ssss_iiii_dddd_eeee_oooo_gggg_rrrr_aaaa_mmmm_((((_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_2222_))))    character codes to be classified as ideograms in
  213.                              supplementary code sets
  214.  
  215.      _iiii_ssss_eeee_nnnn_gggg_llll_iiii_ssss_hhhh_((((_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_3333_))))     character codes to be classified as English
  216.                              letters in supplementary code sets
  217.  
  218.      _iiii_ssss_nnnn_uuuu_mmmm_bbbb_eeee_rrrr_((((_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_4444_))))      character codes to be classified as numeric in
  219.                              supplementary code sets
  220.  
  221.      _iiii_ssss_ssss_pppp_eeee_cccc_iiii_aaaa_llll_((((_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_5555_))))     character codes to be classified as special
  222.                              letters in supplementary code sets
  223.  
  224.      _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_6666                character codes to be classified as other
  225.                              printable letters in supplementary code sets
  226.  
  227.      _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_9999 - _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_2222_2222    character codes to be classified as language-
  228.                              dependent letters/characters
  229.  
  230.      The keywords _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_1111 through _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_2222_2222 correspond to bit names ______EEEE_1111
  231.      through ______EEEE_2222_2222 defined in _wwww_cccc_tttt_yyyy_pppp_eeee_...._hhhh
  232.  
  233.      Any lines with the number sign (_####) in the first column are treated as
  234.      comments and are ignored.  Blank lines are also ignored.
  235.  
  236.      Characters for _iiii_ssss_uuuu_pppp_pppp_eeee_rrrr, _iiii_ssss_llll_oooo_wwww_eeee_rrrr, _iiii_ssss_aaaa_llll_pppp_hhhh_aaaa, _iiii_ssss_dddd_iiii_gggg_iiii_tttt, _iiii_ssss_ssss_pppp_aaaa_cccc_eeee, _iiii_ssss_pppp_uuuu_nnnn_cccc_tttt,
  237.      _iiii_ssss_cccc_nnnn_tttt_rrrr_llll, _iiii_ssss_bbbb_llll_aaaa_nnnn_kkkk, _iiii_ssss_pppp_rrrr_iiii_nnnn_tttt, _iiii_ssss_gggg_rrrr_aaaa_pppp_hhhh, _iiii_ssss_xxxx_dddd_iiii_gggg_iiii_tttt, _uuuu_llll, _iiii_ssss_pppp_hhhh_oooo_nnnn_oooo_gggg_rrrr_aaaa_mmmm,
  238.      _iiii_ssss_iiii_dddd_eeee_oooo_gggg_rrrr_aaaa_mmmm, _iiii_ssss_eeee_nnnn_gggg_llll_iiii_ssss_hhhh, _iiii_ssss_nnnn_uuuu_mmmm_bbbb_eeee_rrrr, _iiii_ssss_ssss_pppp_eeee_cccc_iiii_aaaa_llll, _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_1111-_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_6666 and
  239.      _iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_9999-_iiii_ssss_wwww_cccc_hhhh_aaaa_rrrr_2222_2222 can be represented as hexadecimal or octal constants
  240.      (for example, the letter _aaaa can be represented as _0000_xxxx_6666_1111 in hexadecimal or
  241.      _0000_1111_4444_1111 in octal) and must be up to two byte process codes.  Hexadecimal and
  242.      octal constants may be separated by one or more space and tab characters.
  243.  
  244.      The following is the format of an input specification for _cccc_ssss_wwww_iiii_dddd_tttt_hhhh (byte
  245.      widths for supplementary code sets 2 and 3 are exclusive of the single
  246.      shift characters):
  247.  
  248.           _cccc_ssss_wwww_iiii_dddd_tttt_hhhh _nnnn_1111_[[[[_[[[[_::::_ssss_1111_]]]]_[[[[_,,,,_nnnn_2222_[[[[_::::_ssss_2222_]]]]_[[[[_,,,,_nnnn_3333_[[[[_::::_ssss_3333_]]]]_]]]]_]]]]_]]]]
  249.  
  250.      where,
  251.      _nnnn_1111   byte width for supplementary code set 1
  252.      _ssss_1111   screen width for supplementary code set 1
  253.      _nnnn_2222   byte width for supplementary code set 2
  254.      _ssss_2222   screen width for supplementary code set 2
  255.      _nnnn_3333   byte width for supplementary code set 3
  256.      _ssss_3333   screen width for supplementary code set 3
  257.  
  258.  
  259.                                                                         PPPPaaaaggggeeee 4444
  260.  
  261.  
  262.  
  263.  
  264.  
  265.  
  266. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  267.  
  268.  
  269.  
  270.      The dash character (_----) may be used to indicate a range of consecutive
  271.      numbers (inclusive of the characters delimiting the range).  Zero or more
  272.      space characters may be used for separating the dash character from the
  273.      numbers.
  274.  
  275.      The backslash character (_\\\\) is used for line continuation.  Only a
  276.      carriage return is permitted after the backslash character.
  277.  
  278.      The relationship between conversion letters (_uuuu_llll) is expressed as ordered
  279.      pairs of octal or hexadecimal constants:  <_c_o_n_v_e_r_t_i_n_g-_c_h_a_r_a_c_t_e_r
  280.      _c_o_n_v_e_r_t_e_d-_c_h_a_r_a_c_t_e_r>.  These two constants must be up to two byte process
  281.      codes and may be separated by one or more space characters.  Zero or more
  282.      space characters may be used for separating the angle brackets (< >) from
  283.      the numbers.
  284.  
  285. EEEEXXXXAAAAMMMMPPPPLLLLEEEE
  286.      The following is an example of an input file used to create the JAPAN
  287.      code set definition table on a file named _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE and _LLLL_CCCC______NNNN_UUUU_MMMM_EEEE_RRRR_IIII_CCCC.
  288.  
  289.           #
  290.           # locale JAPAN
  291.           #
  292.           LC_CTYPE  LC_CTYPE
  293.           #
  294.           # specification for single byte characters
  295.           #
  296.           isupper     0x41 - 0x5a
  297.           islower     0x61 - 0x7a
  298.           isdigit     0x30 - 0x39
  299.           isspace     0x20   0x9 - 0xd
  300.           ispunct     0x21 - 0x2f  0x3a - 0x40  \
  301.                       0x5b - 0x60  0x7b - 0x7e
  302.           iscntrl     0x0 - 0x1f   0x7f - 0x9f
  303.           isblank     0x9 0x20
  304.           isprint   0x20
  305.           isxdigit  0x30 - 0x39    0x61 - 0x66    0x41 - 0x46
  306.           ul        <0x41 0x61> <0x42 0x62> <0x43 0x63> \
  307.                     <0x44 0x64> <0x45 0x65> <0x46 0x66> \
  308.                     <0x47 0x67> <0x48 0x68> <0x49 0x69> \
  309.                     <0x4a 0x6a> <0x4b 0x6b> <0x4c 0x6c> \
  310.                     <0x4d 0x6d> <0x4e 0x6e> <0x4f 0x6f> \
  311.                     <0x50 0x70> <0x51 0x71> <0x52 0x72> \
  312.                     <0x53 0x73> <0x54 0x74> <0x55 0x75> \
  313.                     <0x56 0x76> <0x57 0x77> <0x58 0x78> \
  314.                     <0x59 0x79> <0x5a 0x7a>
  315.           cswidth        2:2,1:1,2:2
  316.           LC_NUMERIC     LC_NUMERIC
  317.           decimal_point  .
  318.           thousands_sep
  319.  
  320.  
  321.  
  322.  
  323.  
  324.  
  325.                                                                         PPPPaaaaggggeeee 5555
  326.  
  327.  
  328.  
  329.  
  330.  
  331.  
  332. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  333.  
  334.  
  335.  
  336.           #
  337.           # specification for supplementary code set 1
  338.           #
  339.           LC_CTYPE1
  340.           isupper        0xa3c1 - 0xa3da
  341.           islower        0xa3e1 - 0xa3fa
  342.           isdigit        0xa3b0 - 0xa3b9
  343.           isspace        0xa1a1
  344.           isphonogram    0xa4a1 - 0xa4f3 0xa5a1 - 0xa5f6
  345.           isideogram     0xb0a1 - 0xb0fe 0xb1a1 - 0xb1fe 0xb2a1 - 0xb2fe \
  346.                     0xb3a1 - 0xb3fe 0xb4a1 - 0xb4fe 0xb5a1 - 0xb5fe \
  347.                     0xb6a1 - 0xb6fe 0xb7a1 - 0xb7fe 0xb8a1 - 0xb8fe \
  348.                     0xb9a1 - 0xb9fe 0xbaa1 - 0xbafe 0xbba1 - 0xbbfe \
  349.                     0xbca1 - 0xbcfe 0xbda1 - 0xbdfe 0xbea1 - 0xbefe \
  350.                     0xbfa1 - 0xbffe 0xc0a1 - 0xc0fe 0xc1a1 - 0xc1fe \
  351.                     0xc2a1 - 0xc2fe 0xc3a1 - 0xc3fe 0xc4a1 - 0xc4fe \
  352.                     0xc5a1 - 0xc5fe 0xc6a1 - 0xc6fe 0xc7a1 - 0xc7fe \
  353.                     0xcca1 - 0xccfe 0xcda1 - 0xcdfe 0xcea1 - 0xcefe \
  354.                     0xcfa1 - 0xcffe 0xd0a1 - 0xd0fe 0xd1a1 - 0xd1fe \
  355.                     0xd2a1 - 0xd2fe 0xd3a1 - 0xd3fe 0xd4a1 - 0xd4fe \
  356.                     0xd5a1 - 0xd5fe 0xd6a1 - 0xd6fe 0xd7a1 - 0xd7fe \
  357.                     0xd8a1 - 0xd8fe 0xd9a1 - 0xd9fe 0xdaa1 - 0xdafe \
  358.                     0xdba1 - 0xdbfe 0xdca1 - 0xdcfe 0xdda1 - 0xddfe \
  359.                     0xdea1 - 0xdefe 0xdfa1 - 0xdffe 0xe0a1 - 0xe0fe \
  360.                     0xe1a1 - 0xe1fe 0xe2a1 - 0xe2fe 0xe3a1 - 0xe3fe \
  361.                     0xe4a1 - 0xe4fe 0xe5a1 - 0xe5fe 0xe6a1 - 0xe6fe \
  362.                     0xe7a1 - 0xe7fe 0xe8a1 - 0xe8fe 0xe9a1 - 0xe9fe \
  363.                     0xeaa1 - 0xeafe 0xeba1 - 0xebfe 0xeca1 - 0xecfe \
  364.                     0xeda1 - 0xedfe 0xeea1 - 0xeefe 0xefa1 - 0xeffe \
  365.                     0xf0a1 - 0xf0fe 0xf1a1 - 0xf1fe 0xf2a1 - 0xf2fe \
  366.                     0xf3a1 - 0xf3fe 0xf4a1 - 0xf4fe 0xf5a1 - 0xf5fe \
  367.                     0xf6a1 - 0xf6fe 0xf7a1 - 0xf7fe 0xf8a1 - 0xf8fe \
  368.                     0xf9a1 - 0xf9fe 0xfaa1 - 0xfafe 0xfba1 - 0xfbfe \
  369.                     0xfca1 - 0xfcfe 0xfda1 - 0xfdfe 0xfea1 - 0xfefe
  370.           isenglish 0xa3c1 - 0xa3da 0xa3e1 - 0xa3fa
  371.           isnumber  0xa3b0 - 0xa3b9
  372.           isspecial 0xa1a2 - 0xa1fe 0xa2a1 - 0xa2ae 0xa2ba - 0xa2c1 \
  373.                     0xa2ca - 0xa2d0 0xa2dc - 0xa2ea 0xa2f2 - 0xa2f9 \
  374.                     0xa2fe
  375.           iswchar6  0xa6a1 - 0xa6b8 0xa6c1 - 0xa6d8 0xa7a1 - 0xa7c1 \
  376.                     0xa7d1 - 0xa7f1
  377.           #
  378.           #         JIS X0208 whole code set
  379.           #
  380.           iswchar9  0xa1a1 - 0xa1fe 0xa2a1 - 0xa2fe 0xa3a1 - 0xa3fe \
  381.                     0xa4a1 - 0xa4fe 0xa5a1 - 0xa5fe 0xa6a1 - 0xa6fe \
  382.                     0xa7a1 - 0xa7fe 0xa8a1 - 0xa8fe 0xa9a1 - 0xa9fe \
  383.                     0xaaa1 - 0xaafe 0xaba1 - 0xabfe 0xaca1 - 0xacfe \
  384.                     0xada1 - 0xadfe 0xaea1 - 0xaefe 0xafa1 - 0xaffe \
  385.                     0xb0a1 - 0xb0fe 0xb1a1 - 0xb1fe 0xb2a1 - 0xb2fe \
  386.                     0xb3a1 - 0xb3fe 0xb4a1 - 0xb4fe 0xb5a1 - 0xb5fe \
  387.                     0xb6a1 - 0xb6fe 0xb7a1 - 0xb7fe 0xb8a1 - 0xb8fe \
  388.  
  389.  
  390.  
  391.                                                                         PPPPaaaaggggeeee 6666
  392.  
  393.  
  394.  
  395.  
  396.  
  397.  
  398. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  399.  
  400.  
  401.  
  402.                0xb9a1 - 0xb9fe 0xbaa1 - 0xbafe 0xbba1 - 0xbbfe \
  403.                     0xbca1 - 0xbcfe 0xbda1 - 0xbdfe 0xbea1 - 0xbefe \
  404.                     0xbfa1 - 0xbffe 0xc0a1 - 0xc0fe 0xc1a1 - 0xc1fe \
  405.                     0xc2a1 - 0xc2fe 0xc3a1 - 0xc3fe 0xc4a1 - 0xc4fe \
  406.                     0xc5a1 - 0xc5fe 0xc6a1 - 0xc6fe 0xc7a1 - 0xc7fe \
  407.                     0xc8a1 - 0xc8fe 0xc9a1 - 0xc9fe 0xcaa1 - 0xcafe \
  408.                     0xcba1 - 0xcbfe 0xcca1 - 0xccfe 0xcda1 - 0xcdfe \
  409.                     0xcea1 - 0xcefe 0xcfa1 - 0xcffe 0xd0a1 - 0xd0fe \
  410.                     0xd1a1 - 0xd1fe 0xd2a1 - 0xd2fe 0xd3a1 - 0xd3fe \
  411.                     0xd4a1 - 0xd4fe 0xd5a1 - 0xd5fe 0xd6a1 - 0xd6fe \
  412.                     0xd7a1 - 0xd7fe 0xd8a1 - 0xd8fe 0xd9a1 - 0xd9fe \
  413.                     0xdaa1 - 0xdafe 0xdba1 - 0xdbfe 0xdca1 - 0xdcfe \
  414.                     0xdda1 - 0xddfe 0xdea1 - 0xdefe 0xdfa1 - 0xdffe \
  415.                     0xe0a1 - 0xe0fe 0xe1a1 - 0xe1fe 0xe2a1 - 0xe2fe \
  416.                     0xe3a1 - 0xe3fe 0xe4a1 - 0xe4fe 0xe5a1 - 0xe5fe \
  417.                     0xe6a1 - 0xe6fe 0xe7a1 - 0xe7fe 0xe8a1 - 0xe8fe \
  418.                     0xe9a1 - 0xe9fe 0xeaa1 - 0xeafe 0xeba1 - 0xebfe \
  419.                     0xeca1 - 0xecfe 0xeda1 - 0xedfe 0xeea1 - 0xeefe \
  420.                     0xefa1 - 0xeffe 0xf0a1 - 0xf0fe 0xf1a1 - 0xf1fe \
  421.                     0xf2a1 - 0xf2fe 0xf3a1 - 0xf3fe 0xf4a1 - 0xf4fe \
  422.                     0xf5a1 - 0xf5fe 0xf6a1 - 0xf6fe 0xf7a1 - 0xf7fe \
  423.                     0xf8a1 - 0xf8fe 0xf9a1 - 0xf9fe 0xfaa1 - 0xfafe \
  424.                     0xfba1 - 0xfbfe 0xfca1 - 0xfcfe 0xfda1 - 0xfdfe \
  425.                     0xfea1 - 0xfefe
  426.           #
  427.           #         JIS X0208 parentheses
  428.           #
  429.           iswchar10 0xa1c6 - 0xa1db
  430.           #
  431.           #         JIS X0208 hiragana
  432.           #
  433.           iswchar11 0xa4a1 - 0xa4f3
  434.           #
  435.           #         JIS X0208 katakana
  436.           #
  437.           iswchar12 0xa5a1 - 0xa5f6
  438.           #
  439.           #         JIS X0208 other characters
  440.           #
  441.           iswchar13 0xa6a1 - 0xa6b8 0xa6c1 - 0xa6d8 0xa7a1 - 0xa7c1 \
  442.                     0xa7d1 - 0xa7f1 0xa8a1 - 0xa8bf
  443.           #
  444.           #         English letter translation table
  445.           #
  446.           ul        <0xa3c1 0xa3e1> <0xa3c2 0xa3e2> <0xa3c3 0xa3e3> \
  447.                     <0xa3c4 0xa3e4> <0xa3c5 0xa3e5> <0xa3c6 0xa3e6> \
  448.                     <0xa3c7 0xa3e7> <0xa3c8 0xa3e8> <0xa3c9 0xa3e9> \
  449.                     <0xa3ca 0xa3ea> <0xa3cb 0xa3eb> <0xa3cc 0xa3ec> \
  450.                     <0xa3cd 0xa3ed> <0xa3ce 0xa3ee> <0xa3cf 0xa3ef> \
  451.                     <0xa3d0 0xa3f0> <0xa3d1 0xa3f1> <0xa3d2 0xa3f2> \
  452.                     <0xa3d3 0xa3f3> <0xa3d4 0xa3f4> <0xa3d5 0xa3f5> \
  453.                     <0xa3d6 0xa3f6> <0xa3d7 0xa3f7> <0xa3d8 0xa3f8> \
  454.  
  455.  
  456.  
  457.                                                                         PPPPaaaaggggeeee 7777
  458.  
  459.  
  460.  
  461.  
  462.  
  463.  
  464. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  465.  
  466.  
  467.  
  468.                <0xa3d9 0xa3f9> <0xa3da 0xa3fa> \
  469.           #
  470.           #         kana translation table
  471.           #
  472.                     <0xa4a1 0xa5a1> <0xa4a2 0xa5a2> <0xa4a3 0xa5a3> \
  473.                     <0xa4a4 0xa5a4> <0xa4a5 0xa5a5> <0xa4a6 0xa5a6> \
  474.                     <0xa4a7 0xa5a7> <0xa4a8 0xa5a8> <0xa4a9 0xa5a9> \
  475.                     <0xa4aa 0xa5aa> <0xa4ab 0xa5ab> <0xa4ac 0xa5ac> \
  476.                     <0xa4ad 0xa5ad> <0xa4ae 0xa5ae> <0xa4af 0xa5af> \
  477.                     <0xa4b0 0xa5b0> <0xa4b1 0xa5b1> <0xa4b2 0xa5b2> \
  478.                     <0xa4b3 0xa5b3> <0xa4b4 0xa5b4> <0xa4b5 0xa5b5> \
  479.                     <0xa4b6 0xa5b6> <0xa4b7 0xa5b7> <0xa4b8 0xa5b8> \
  480.                     <0xa4b9 0xa5b9> <0xa4ba 0xa5ba> <0xa4bb 0xa5bb> \
  481.                     <0xa4bc 0xa5bc> <0xa4bd 0xa5bd> <0xa4be 0xa5be> \
  482.                     <0xa4bf 0xa5bf> <0xa4c0 0xa5c0> <0xa4c1 0xa5c1> \
  483.                     <0xa4c2 0xa5c2> <0xa4c3 0xa5c3> <0xa4c4 0xa5c4> \
  484.                     <0xa4c5 0xa5c5> <0xa4c6 0xa5c6> <0xa4c7 0xa5c7> \
  485.                     <0xa4c8 0xa5c8> <0xa4c9 0xa5c9> <0xa4ca 0xa5ca> \
  486.                     <0xa4cb 0xa5cb> <0xa4cc 0xa5cc> <0xa4cd 0xa5cd> \
  487.                     <0xa4ce 0xa5ce> <0xa4cf 0xa5cf> <0xa4d0 0xa5d0> \
  488.                     <0xa4d1 0xa5d1> <0xa4d2 0xa5d2> <0xa4d3 0xa5d3> \
  489.                     <0xa4d4 0xa5d4> <0xa4d5 0xa5d5> <0xa4d6 0xa5d6> \
  490.                     <0xa4d7 0xa5d7> <0xa4d8 0xa5d8> <0xa4d9 0xa5d9> \
  491.                     <0xa4da 0xa5da> <0xa4db 0xa5db> <0xa4dc 0xa5dc> \
  492.                     <0xa4dd 0xa5dd> <0xa4de 0xa5de> <0xa4df 0xa5df> \
  493.                     <0xa4e0 0xa5e0> <0xa4e1 0xa5e1> <0xa4e2 0xa5e2> \
  494.                     <0xa4e3 0xa5e3> <0xa4e4 0xa5e4> <0xa4e5 0xa5e5> \
  495.                     <0xa4e6 0xa5e6> <0xa4e7 0xa5e7> <0xa4e8 0xa5e8> \
  496.                     <0xa4e9 0xa5e9> <0xa4ea 0xa5ea> <0xa4eb 0xa5eb> \
  497.                     <0xa4ec 0xa5ec> <0xa4ed 0xa5ed> <0xa4ee 0xa5ee> \
  498.                     <0xa4ef 0xa5ef> <0xa4f0 0xa5f0> <0xa4f1 0xa5f1> \
  499.                     <0xa4f2 0xa5f2> <0xa4f3 0xa5f3>
  500.           #
  501.           # specification for supplementary code set 2
  502.           #
  503.           LC_CTYPE2
  504.           iswchar6  0xa1 - 0xdf
  505.           iswchar14 0xa1 - 0xdf
  506.  
  507. FFFFIIIILLLLEEEESSSS
  508.      _////_uuuu_ssss_rrrr_////_llll_iiii_bbbb_////_llll_oooo_cccc_aaaa_llll_eeee_////_l_o_c_a_l_e_////_LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE
  509.                      data files containing character classification and
  510.                      conversion tables and character set width information
  511.                      created by _cccc_hhhh_rrrr_tttt_bbbb_llll or _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll.
  512.      _////_uuuu_ssss_rrrr_////_llll_iiii_bbbb_////_llll_oooo_cccc_aaaa_llll_eeee_////_l_o_c_a_l_e_////_LLLL_CCCC______NNNN_UUUU_MMMM_EEEE_RRRR_IIII_CCCC
  513.                      data files containing numeric editing information.
  514.      _////_uuuu_ssss_rrrr_////_iiii_nnnn_cccc_llll_uuuu_dddd_eeee_////_cccc_tttt_yyyy_pppp_eeee_...._hhhh
  515.                      header file containing information used by character
  516.                      classification and conversion routines for single byte
  517.                      characters.
  518.  
  519.  
  520.  
  521.  
  522.  
  523.                                                                         PPPPaaaaggggeeee 8888
  524.  
  525.  
  526.  
  527.  
  528.  
  529.  
  530. wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))                                                        wwwwcccchhhhrrrrttttbbbbllll((((1111MMMM))))
  531.  
  532.  
  533.  
  534.      _////_uuuu_ssss_rrrr_////_iiii_nnnn_cccc_llll_uuuu_dddd_eeee_////_wwww_cccc_tttt_yyyy_pppp_eeee_...._hhhh
  535.                      header file containing information used by international
  536.                      character classification and conversion routines for
  537.                      supplementary code sets.
  538.      _////_uuuu_ssss_rrrr_////_iiii_nnnn_cccc_llll_uuuu_dddd_eeee_////_xxxx_cccc_tttt_yyyy_pppp_eeee_...._hhhh
  539.                      header file containing information used by language-
  540.                      dependent character classification and conversion
  541.                      routines for supplementary code sets.
  542.  
  543. SSSSEEEEEEEE AAAALLLLSSSSOOOO
  544.      _cccc_tttt_yyyy_pppp_eeee(3C), _ssss_eeee_tttt_llll_oooo_cccc_aaaa_llll_eeee(3C), _wwww_cccc_tttt_yyyy_pppp_eeee(3W), _eeee_nnnn_vvvv_iiii_rrrr_oooo_nnnn(5)
  545.  
  546. DDDDIIIIAAAAGGGGNNNNOOOOSSSSTTTTIIIICCCCSSSS
  547.      The error messages produced by _wwww_cccc_hhhh_rrrr_tttt_bbbb_llll are intended to be self-
  548.      explanatory.  They indicate errors in the command line or syntactic
  549.      errors encountered within the input file.
  550.  
  551. NNNNOOOOTTTTEEEESSSS
  552.      In IRIX 6.5, the content of the _LLLL_CCCC______CCCC_TTTT_YYYY_PPPP_EEEE locale category was extended to
  553.      comply with the XPG/4 standard.  The older LC_CTYPE binary format will
  554.      not be recognized by the C library.  Therefore, all custom-built locales
  555.      created under an older version of IRIX must be regenerated with the later
  556.      versions of _llll_oooo_cccc_aaaa_llll_eeee_dddd_eeee_ffff(1) and associated _cccc_hhhh_rrrr_tttt_bbbb_llll(1M)/_wwww_cccc_hhhh_rrrr_tttt_bbbb_llll(1M).
  557.  
  558.  
  559.  
  560.  
  561.  
  562.  
  563.  
  564.  
  565.  
  566.  
  567.  
  568.  
  569.  
  570.  
  571.  
  572.  
  573.  
  574.  
  575.  
  576.  
  577.  
  578.  
  579.  
  580.  
  581.  
  582.  
  583.  
  584.  
  585.  
  586.  
  587.  
  588.  
  589.                                                                         PPPPaaaaggggeeee 9999
  590.  
  591.  
  592.  
  593.